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<110> 



Input Set : A:\19452a-7sub.app 

Output Set: Nj\CRF3\02052002\I548971B,raw 



Sarah 



ENT 



APPLICANT: Liljegren, 
Yanofsky-/ Martin F. 

The Regents of the University of California 

TITLE OP INVENTION: Control of Fruit Dehiscence in Arabidopsis 3^ 
INDEHISCENTl Genes 
FILE REFERENCE: 194 52A- 000700US 
CURRENT APPLICATION NUMBER: US 09/548, 971B 
CURRENT FILING DATE: 1996-09-22 
PRIOR APPLICATION NUMBER: US 60/090,649 
PRIOR FILING DATE: 1998-06-25 
PRIOR APPLICATION NUMBER: US 09/339,998 
PRIOR FILING DATE: 1999-06-25 
NUMBER OP SEQ ID NOS : 9 
SOFTWARE: PatentIn Ver . 2.1 
SEQ ID NO: 1 
LENGTH: 3856 
TYPE: DNA 

ORGANISM: Arabidopsis thaliana 
FEATURE : 

OTHER INFORMATION: INDEHISCENTl ( INDl ) genomic 
FEATURE : 
NAME/KEY: CDS 
LOCATION: (2765). 
OTHER INFORMATION 
SEQUENCE : 1 





ED 



<120> 

<130> 
<140> 
<141> 

<150> 
<151> 
<150> 
<151> 
<160> 
<170> 
<210> 
<211> 
<212> 
<213> 
<220> 
<223> 
<220> 
<221> 
<222> 
<223> 
<400> 

ctctagacca tctactatcc ggttgttgac ccttaaagct tttgaagact 
tgcaaatacc atatgtccat atccatcctt ttcttttgtt tgaactgaac 
tgtaaaagaa aaaaccttat gttaatatca ccgtaggcaa aaaaaatatc 
aaatttttat tataagatta tacattctct cgttgtaaga gttactccaa 
tgtattaact aataaaaagg acgaaagtag gaagcttata attaattgat 
actggtatat tgttgatgaa tataacaagt atgaacatta atgcatgaaa 
tgtcttgaac tcattaaagg caatgtgaaa agaagatgtg aggtctcatt 
atcttctagc tttgtcgatt ttaaatctat gaaatgaacg caacatatag 
tggacaacga catttagacg gtatcttaat tagaccgatt aattagtaat 
atataattag tggtgattat aagtttactt atccacttga gaatttaaac 
accttaatgt cgaaagaagc cgtccccact tcgtgtaatg agttatgggg 
gttaaatcgt caaataaaac aacttaagaa ctagaaattg acaccaaaaa 
aacgttgaag aagtcattta tcgtatccag ctcatatttc ctagctaaga 
gccgttgaaa gggcttgtaa gaaaatgtcg aagaaaccgt ggggtttaga 
gaaatagaag aacaatgatg ttaaattgcc tattttggtg tataggagtt 
gagagagaga agaaaattag gtcaaaataa tgagcactaa aaatggagac 
taactattac aagagcgact tatgcttcct tatggcaatg atatccaaac 
cgctcctttt ttgccctaat ttcgtaaagt ctctctcctt cttcgtcctt 



(3361) 
INDl 



% 



actagaataa 60 
attctaattt 120 
tcatcatatt 180 
ttgcaagtgt 240 
gttgcatagt 300 
cggggtattt 360 
ttgaaaattt 420 
aaatttcatg 480 
atacttatat 540 
aatgggcaat 600 
gagagatcct 660 
tcataaagag 720 
tcaaatcaag 780 
agaaagacaa 840 
gtcaaaagag 900 
atgtgttgag 960 
caaagtgcaa 1020 
aggaaaaacc 1080 



file://C:\Crf3\Outhold\VsrI54897 1 B.htm 



2/5/02 



Page 2 of 7 



RAW SEQUENCE LISTING DATE: 02/05/2002 

PATENT APPLICATION: US/09/548 , 971B TIME: 14:17:10 

Input Set : A:\19452a-7sub.app 

Output Set: N:\CRF3\02052002\X548971B.raw 

75 ctagaaattt aatcccttgt tcttgatctt gctttttgag taaccatgat tttgaccaca 1140 
7 7 cactatttct tctatctttt gtggtctata ggattttgct ttatatgtgt . ttcttgtatt 12 00 
79 gctccgtacg tacgtatacg aatttaaatg gttataacaa ggtttatata aactagcaca 1260 
81 aatgagtcca tgaaatttgt tagcgaaaaa ggtagaaata tattgagtct ttaaacggca 1320 
83 atatatataa ttttgctgca aaacttagct ttaatcatga tctaatgata ttttctttaa 1380 
85 tttcctttgc caaattaatc acatgcacgg atttttggca agttatgtgt cgaattcttc 1440 
87 cattcacaca acactaaact taattagaac tctaggaaat attttaaaat gacaacttta 1500 
89 tcgaaaaaaa tttagttatg aaaacaattc cagaattaaa catgagctat ataatttaag 1560 
91 ataaaatgaa gtaatattga tatgtatgta ataacatatc tgattgcggt aaaaaaaaac 1620 
93 atatctgatt aaattgttca tgcaggccca tgtcactatg atgtcatcac gtttttattt 1680 
95 tcacaataac taatatatat tcaaaaaaat agttttgtca gattaaattt tttttggtgg 1740 
97 tcagctttct ccaacctact aaactagttt ggaatgttct cttctttatt tttctttttc 1800 
99 ttgatttctt atgtttttta tttatggaat tttaagacgg attgtttagg tcgtttctct 1860 
101 cttttcttgt tttctaaagt tacttttgta aactcatctc ctcccaatta gacagtcaat 1920 
103 catatagtta tcttttaata tatgtctagt tgataaaaaa aatgaaaaaa tactggtggt 1980 
105 agttctacta atgtttgtgt aaaaaatctg atattatgaa tctaatcaat ttctttgatc 2040 
107 gtataatgtg ggttaaattt agtaattttt tacataaata agaactgtaa tgttgatgta 2100 
109 tattggggaa tcagtatatt agcttgggta actatacttc tggaaatact tgaagattta 2160 
111 actatttgca aaattataat ttagtcccga aaaatacaga cgacgggaca cgacaacata 2220 
113 taagcaggtt tgaatcttgg aaaattttgt atacataacc tatataaata ctaatgttct 2280 
115 ggttgggttc aaaagccttt tcaaaagttc cattttttaa attcaaggac attttacata 2340 
117 ggaaataagt tgagtcataa aaaataatgg ttattttgta aggttttttt tttgattaaa 2400 
119 acgcacatat taagaagtta gttttttttc actaccaaat atcaattaat ttaaaaccat 2460 
121 gcaaccattc ataaaacaat actattaaag aatataaata atcacaaaat attaaataca 2520 
12 3 cttaaaattt acatataaat ttacaaaaca tctaattaat tgaaacagaa aggaaaaggt 2 580 
125 aaaatatatc ataaaatgag acatatatcc tataaaaaaa aaatgaggca tatgaagtaa 2640 
127 ataataagag acatgcatgt aagcattcgg ttaattaatc gagtcaaaga tatatatcag 2700 
129 taaatacata tgtgtatatt tctggaaaaa gaatatatat attgagaaat aagaaaagat 2760 

131 gaaa atg gaa aat ggt atg tat aaa aag aaa gga gtg tgc gac tct tgt 2809 

132 Met Glu Asn Gly Met Tyr Lys Lys Lys Gly Val Cys Asp Ser Cys 



133 




1 










5 








10 








15 




135 


gtc 


teg 


tec 


aaa 


age 


aga 


tec 


aac 


eae 


age 


ccc 


aaa 


aga 


age 


atg 


atg 


2857 


136 


Val 


Ser 


Ser 


Lys 


Ser 


Arg 


Ser 


Asn 


His 


Ser 


Pro 


Lys 


Arg 


Ser 


Met 


Met 




137 










20 










25 










30 






139 


gag 


cct 


cag 


cct 


cac 


cat 


etc 


etc 


atg 


gat 


tgg 


aac 


aaa 


get 


aat 


gat 


2905 


140 


Glu 


Pro 


Gin 


Pro 


His 


His 


Leu 


Leu 


Met 


Asp 


Trp 


Asn 


Lys 


Ala 


Asn 


Asp 




141 








35 










40 










45 








143 


ctt 


etc 


aca 


caa 


gaa 


cac 


gca 


get 


ttt 


etc 


aat 


gat 


cct 


cac 


cat 


etc 


2953 


144 


Leu 


Leu 


Thr 


Gin 


Glu 


His 


Ala 


Ala 


Phe 


Leu 


Asn 


Asp 


Pro 


His 


His 


Leu 




145 






50 










55 










60 










147 


atg 


tta 


gat 


cca 


cct 


ccc 


gaa 


acc 


eta 


att 


cac 


ttg 


gac 


gaa 


gac 


gaa 


3001 


148 


Met 


Leu 


Asp 


Pro 


Pro 


Pro 


Glu 


Thr 


Leu 


He 


His 


Leu 


Asp 


Glu 


Asp 


Glu 




149 




65 










70 










75 












151 


gag 


tac 


gat 


gaa 


gac 


atg 


gat 


gcg 


atg 


aag 


gag 


atg 


cag 


tac 


atg 


ate 


3049 


152 


Glu 


Tyr 


Asp 


Glu 


Asp 


Met 


Asp 


Ala 


Met 


Lys 


Glu 


Met 


Gin 


Tyr 


Met 


He 




153 


80 










85 










90 










95 




155 


gcc 


gtc 


atg 


cag 


ccc 


gta 


gac 


ate 


gac 


cct 


gcc 


acg 


gtc 


cct 


aag 


ccg 


3097 


156 


Ala 


Val 


Met 


Gin 


Pro 


Val 


Asp 


lie 


Asp 


Pro 


Ala 


Thr 


val 


Pro 


Lys 


Pro 




157 










100 










105 










110 







file://C:\Crf3\OutholcfvVsrI548971B.htm 



2/5/02 



Page 3 of 7 



RAW SEQUENCE LISTING DATE: 02/05/2002 

PATENT APPLICATION: US/09/548, 971B TIME: 14:17:10 

Input Set : A:\I9452a-7sub.app 

Output Set: N:\CRF3\02052002\I548971B.raw 

159 aac cgc cgt aac gta agg ata age gac gat cct cag acg gtg gtt get 3145 

160 Asn Arg Arg Asn Val Arg He Ser Asp Asp Pro Gin Thr Val Val Ala 

161 115 120 125 

163 cgt egg cgt egg gaa agg ate age gag aag ate cga att etc aag agg 3193 

164 Arg Arg Arg Arg Glu Arg He Ser Glu Lys He Arg He Leu Lys Arg 

165 130 135. 140 

167 ate gtg ect ggt ggt gcg aag atg gac aca get tec atg etc gac gaa 3241 

168 He Val Pro Gly Gly Ala Lys Met Asp Thr Ala Ser Met Leu Asp Glu 

169 145 150 155 

171 gee ata cgt tae ace aag ttc ttg aaa egg cag gtg agg att ctt cag 3289 

172 Ala He Arg Tyr Thr Lys Phe Leu Lys Arg Gin Val Arg He Leu Gin 

173 160 165 170 175 

175 cct cac tct cag att gga get cct atg get aac ccc tct tae ctt tgt 3337 

176 Pro His Ser Gin He Gly Ala Pro Met Ala Asn Pro Ser Tyr Leu Cys 

177 180 185 190 

179 tat tae cac aac tec caa ccc tga tgaactacac agaagetcgc tagetagaca 3391 

180 Tyr Tyr His Asn Ser Gin Pro 

181 195 

183 tttggtgtca tcctctcaac ctttttcatg ttgatatatt atatatagat gcataaagat 3451 

185 tcgatccaag attgtatggg tgttttaata ttattattet aagatatatg atgtacaatt 3511 

187 gtgtaccaag tttctttatc ttgatatcat atgcataaat aattggtgaa taaaaagaag 3571 

189 atattgattg taaacaaaaa aaagaagata ttgattgtta attagggttt gateattctg 3631 

191 tatgaaagct ttggcctgca aattaatttt cgatatatat atatatatat ggagaatata 3691 

193 tateaaatac ttttttaatt tgactataat ttgtatcaat tatetgaatc tgatgagtgt 3751 

195 aggttatata tggattagca aaaaagaaaa caaecattat tacgcaecta cattaaaaat 3811 

197 eatccaccaa agaagaaacc atcctcaaga gggttcccte tagag 3856 

200 <210> SEQ ID NO: 2 

201 <211> LENGTH: 198 

202 <212> TYPE: PRT 

203 <213> ORGANISM: Arabidopsis thaliana 

W--> 204 <220> FEATURE: 

205 <223> OTHER INFORMATION: INDEHISCENTl (INDl) protein 

207 <400> SEQUENCE: 2 

208 Met Glu Asn Gly Met Tyr Lys Lys Lys Gly Val Cys Asp Ser Cys Val 

209 15 10 15 

210 Ser Ser Lys Ser Arg Ser Asn His Ser Pro Lys Arg Ser Met Met Glu 

211 20 25 30 

212 Pro Gin Pro His His Leu Leu Met Asp Trp Asn Lys Ala Asn Asp Leu 

213 35 40 45 

214 Leu Thr Gin Glu His Ala Ala Phe Leu Asn Asp Pro His His Leu Met 

215 50 55 60 

216 Leu Asp Pro Pro Pro Glu Thr Leu He His Leu Asp Glu Asp Glu Glu 

217 65 70 75 80 

218 Tyr Asp Glu Asp Met Asp Ala Met Lys Glu Met Gin Tyr Met He Ala 

219 85 90 95 

220 Val Met Gin Pro Val Asp He Asp Pro Ala Thr Val Pro Lys Pro Asn 

221 100 105 110 

222 Arg Arg Asn Val Arg He Ser Asp Asp Pro Gin Thr Val Val Ala Arg 

223 115 120 125 
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224 Arg Arg Arg Glu Arg lie Ser Glu Lys lie Arg lie Leu Lys Arg lie 

225 130 135 140 

226 Val Pro Gly Gly Ala Lys Met Asp Thr Ala Ser Met Leu Asp Glu Ala 

227 145 150 155 160 

228 lie Arg Tyr Thr Lys Phe Leu Lys Arg Gin Val Arg lie Leu Gin Pro 

229 165 170 175 

230 His Ser Gin lie Gly Ala Pro Met Ala Asn Pro Ser Tyr Leu Cys Tyr 

231 180 185 190 

232 Tyr His Asn Ser Gin Pro 

233 195 

236 <210> SEQ ID NO: 3 

237 <211> LENGTH: 2765 

238 <212> TYPE: DNA 

239 <213> ORGANISM: Arabidopsis thaliana 

241 <22 0> FEATURE: 

242 <22 3> OTHER INFORMATION: 5' untranslated region INDl promoter 
244 <400> SEQUENCE: 3 

24 5 ctctagacca tctactatcc ggttgttgac ccttaaagct tttgaagact actagaataa 60 

246 tgcaaatacc atatgtccat atccatcctt ttcttttgtt tgaactgaac attctaattt 120 

247 tgtaaaagaa aaaaccttat gttaatatca ccgtaggcaa aaaaaatatc tcatcatatt 180 

248 aaatttttat tataagatta tacattctct cgttgtaaga gttactccaa ttgcaagtgt 24 0 
24 9 tgtattaact aataaaaagg acgaaagtag gaagcttata attaattgat gttgcatagt 300 
2 50 actggtatat tgttgatgaa tataacaagt atgaacatta atgcatgaaa cggggtattt 360 
2 51 tgtcttgaac tcattaaagg caatgtgaaa agaagatgtg aggtctcatt ttgaaaattt 420 

252 atcttctagc tttgtcgatt ttaaatctat gaaatgaacg caacatatag aaatttcatg 480 

253 tggacaacga catttagacg gtatcttaat tagaccgatt aattagtaat atacttatat 540 

254 atataattag tggtgattat aagtttactt atccacttga gaatttaaac aatgggcaat 600 

255 accttaatgt cgaaagaagc cgtccccact tcgtgtaatg agttatgggg gagagatcct 660 
2 56 gttaaatcgt caaataaaac aacttaagaa ctagaaattg acaccaaaaa tcataaagag 720 
2 57 aacgttgaag aagtcattta tcgtatccag ctcatatttc ctagctaaga tcaaatcaag 780 

258 gccgttgaaa gggcttgtaa gaaaatgtcg aagaaaccgt ggggtttaga agaaagacaa 840 

259 gaaatagaag aacaatgatg ttaaattgcc tattttggtg tataggagtt gtcaaaagag 900 

260 gagagagaga agaaaattag gtcaaaataa tgagcactaa aaatggagac atgtgttgag 960 

261 taactattac aagagcgact tatgcttcct tatggcaatg atatccaaac caaagtgcaa 1020 

262 cgctcctttt ttgccctaat ttcgtaaagt ctctctcctt cttcgtcctt aggaaaaacc 1080 

263 ctagaaattt aatcccttgt tcttgatctt gctttttgag taaccatgat tttgaccaca 1140 

264 cactatttct tctatctttt gtggtctata ggattttgct ttatatgtgt ttcttgtatt 1200 

265 gctccgtacg tacgtatacg aatttaaatg gttataacaa ggtttatata aactagcaca 1260 

266 aatgagtcca tgaaatttgt tagcgaaaaa ggtagaaata tattgagtct ttaaacggca 1320 

267 atatatataa ttttgctgca aaacttagct ttaatcatga tctaatgata ttttctttaa 1380 

268 tttcctttgc caaattaatc acatgcacgg atttttggca agttatgtgt cgaattcttc 1440 

269 cattcacaca acactaaact taattagaac tctaggaaat attttaaaat gacaacttta 1500 

270 tcgaaaaaaa tttagttatg aaaacaattc cagaattaaa catgagctat ataatttaag 1560 

271 ataaaatgaa gtaatattga tatgtatgta ataacatatc tgattgcggt aaaaaaaaac 1620 

272 atatctgatt aaattgttca tgcaggccca tgtcactatg atgtcatcac gtttttattt 1680 

273 tcacaataac taatatatat tcaaaaaaat agttttgtca gattaaattt tttttggtgg 1740 

274 tcagctttct ccaacctact aaactagttt ggaatgttct cttctttatt tttctttttc 1800 

275 ttgatttctt atgtttttta tttatggaat tttaagacgg attgtttagg tcgtttctct 1860 

276 cttttcttgt tttctaaagt tacttttgta aactcatctc ctcccaatta gacagtcaat 1920 
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277 catatagtta tcttttaata tatgtctagt tgataaaaaa aatgaaaaaa tactggtggt 1980 

278 agttctacta atgtttgtgt aaaaaatctg atattatgaa tctaatcaat ttctttgatc 2040 

279 gtataatgtg ggttaaattt agtaattttt tacataaata agaactgtaa tgttgatgta 2100 

280 tattggggaa tcagtatatt agcttgggta actatacttc tggaaatact tgaagattta 2160 

281 actatttgca aaattataat ttagtcccga aaaatacaga cgacgggaca cgacaacata 2220 

282 taagcaggtt tgaatcttgg aaaattttgt atacataacc tatataaata ctaatgttct 2280 

283 ggttgggttc aaaagccttt tcaaaagttc cattttttaa attcaaggac attttacata 2340 

284 ggaaataagt tgagtcataa aaaataatgg ttattttgta aggttttttt tttgattaaa 24 00 

285 acgcacatat taagaagtta gttttttttc actaccaaat atcaattaat ttaaaaccat 2460 

286 gcaaccattc ataaaacaat actattaaag aatataaata atcacaaaat attaaataca 2520 

287 cttaaaattt acatataaat ttacaaaaca tctaattaat tgaaacagaa aggaaaaggt 2580 

288 aaaatatatc ataaaatgag acatatatcc tataaaaaaa aaatgaggca tatgaagtaa 2640 

289 ataataagag acatgcatgt aagcattcgg ttaattaatc gagtcaaaga tatatatcag 2 700 

290 taaatacata tgtgtatatt tctggaaaaa gaatatatat attgagaaat aagaaaagat 2760 

291 gaaaa 2765 

294 <210> SEQ ID NO : 4 

295 <211> LENGTH: 496 

296 <212> TYPE: DNA 

297 <213> ORGANISM: Arabidopsis thaliana 

299 <220> FEATURE: 

300 <223> OTHER INFORMATION: 3' untranslated region INDl promoter 

302 <400> SEQUENCE: 4 

303 atgaactaca cagaagctcg ctagctagac atttggtgtc atcctctcaa cctttttcat 60 

304 gttgatatat tatatataga tgcataaaga ttcgatccaa gattgtatgg gtgttttaat 120 

305 attattattc taagatatat gatgtacaat tgtgtaccaa gtttctttat cttgatatca 180 

306 tatgcataaa taattggtga ataaaaagaa gatattgatt gtaaacaaaa aaaagaagat 240 

307 attgattgtt aattagggtt tgatcattct gtatgaaagc tttggcctgc aaattaattt 300 

308 tcgatatata tatatatata tggagaatat atatcaaata cttttttaat ttgactataa 360 

309 tttgtatcaa ttatctgaat ctgatgagtg taggttatat atggattagc aaaaaagaaa 4 20 

310 acaaccatta ttacgcacct acattaaaaa tcatccacca aagaagaaac catcctcaag 480 

311 agggttccct ctagag 496 

314 <210> SEQ ID NO: 5 

315 <211> LENGTH; 5622 

316 <212> TYPE: DNA 

317 <213> ORGANISM; Arabidopsis thaliana 

319 <22 0> FEATURE: 

320 <22 3> OTHER INFORMATION: SHATTERPROOFl (SHPl) genomic 
322 <220> FEATURE: 

32 3 <221> NAME/KEY: modif ied_base 

324 <222> LOCATION: (935)., (941) 

325 <223> OTHER INFORMATION: n - any nucleotide 
32 7 <4 00> SEQUENCE: 5 

328 agatctgcaa cagtgaaaag agaaaacaaa atggacttga agaggttttg acaatgccag 60 

329 agataatgct tattccctaa tatgttgcca gccaagtgtc aaattggctt tttaaatatg 120 

330 gatttctgta tcagtggtca tatttgtgga tccaacgtat tcatcatcaa gttctcaagt 180 

331 ttgctttcag tgcaattcta attcacacgt ttaactttaa catgcatgtc attataatta 240 

332 cttcttcact aagacacaat acggcaaacc tttcagatta tattaatctc cataaatgaa 300 

333 ataattaacc tcataatcaa gattcaatgt ttctaaatat atatggacaa aatttacacg 360 
3 34 gaagattaga tacgtatatt agtagattta gtctttcgtt tgtgcgataa gattaaccac 4 20 
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Input Set : A:\19452a-7sub.app 
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L:13 M:271 C: Current Filing Date differs, Replaced Current Piling Date 
L:204 M:283 W: Missing Blank Line separator ; <220> field identifier 
L:343 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 5 
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